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ABSTRACT: We present a measurement of the charged particle reconstruction inefficiency inside of 
jet cores, using data collected by the ATLAS experiment in 2015 of pp collisions produced at the 
LHC, at a center-of-mass energy of 13 TeV. The determination of this inefficiency is important for 
jet energy scale and mass calibration, as well as multiple other performance studies and analyses. 
A data driven method is used, where the fraction of lost particle tracks is determined from energy 
deposition dE/dx in the pixel detector. The fraction of lost tracks is found to be less than 5%, 
which is an improvement since the previous study, and agrees well within systematic uncertainties 
with a Monte Carlo simulation. 
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1 Introduction 


The performance of charged particle track reconstruction in dense environments, such as the core 
of high pr jets, is important for many analyses and performance studies. Examples include b- 
tagging and boosted T reconstruction, jet energy scale and mass determination, and analyses using 
jet substructure information. The leading source of systematic uncertainty is in many cases the 
uncertainty on track reconstruction efficiency. Thus, it is necessary to maintain a high tracking 
efficiency also inside of jets, and to determine the (in-)efficiency precisely. 

Tracks are reconstructed from hit clusters in the inner detector. Clusters shared between multi- 
ple tracks are penalized during reconstruction, to ensure high quality tracks, as one of the tracks in 
this case is likely to be fake. However, in dense environments this is a disadvantage, since clusters 
from close-by tracks can naturally merge. To account for this, an artificial neural network is trained 
to identify and not penalize such merged clusters [1, 2]. This greatly increases the percentage of 
correct associations of clusters to tracks at small track separations, and improves b-tagging and T 
reconstruction performance. To determine the remaining inefficiency of tracking in dense environ- 
ments, a data driven method is applied, where the fraction of lost tracks is determined from the 
energy deposition dE/dx in the pixel detector (section 4). 

The ATLAS [3] pixel detector is part of the inner detector system, together with the semi- 
conductor tracker (SCT) and the transition radiation tracker (TRT). The pixel detector is built in a 
barrel and disc geometry, and has a pseudorapidity coverage! of |7| < 2.5. It is built mostly from 
planar silicon pixel modules. The insertable B-Layer (IBL), which was added after a long shut- 
down (2013-2015), includes planar and 3D sensors [4]. There are four barrels, including the IBL, 
situated at r = 33.2, 50.5, 88.5 and 122.5 mm. The forward region is instrumented with three disk 


‘ATLAS uses a right handed coordinate system, centered around the nominal interaction point (IP) at the center of 
the detector. The x axis points towards the center of the LHC, the y axis points upwards, and the z axis in parallel to 
the proton beams. In the transverse plane, cylindrical coordinates (r, 9) are used, where g is the polar angle around the z 
axis. Instead of azimuthal angle 0, pseudorapidity 7 = — In tan 8/2 is often used. 


pairs at z = +495, 580 and 650 mm. Sensor pixels are typically 50 um in transverse direction and 
400 um in longitudinal direction, whereas pixels of the IBL are only 250 um longitudinally. 

A measure of energy deposition dE/dx is given by the pixel detector time-over-threshold 
(ToT). This is the time that a pulse, caused by a particle, spends over a given threshold, and is 
approximately proportional to the collected charge. In particle reconstruction, pixels are grouped 
by a clustering algorithm into clusters. The dE/dx value of a cluster is determined by the total 
collected charge. 

Since the magnetic field of the detector bends particle trajectories apart as they move out of 
the detector, nearby clusters are more likely to merge closer to the interaction point. However, the 
IBL only encodes ToT information in 4 bits, whereas the next layer, the B-layer, uses 8 bits and 
provides a better ToT resolution. For that reason, in the following the dE/dx information from 
clusters in the B-Layer will be used. 


2 Samples 


For this analysis, data samples recorded by the ATLAS detector in 2015 (Run II) of proton-proton 
collisions produced by the LHC at ys = 13 TeV were used, corresponding to an integrated lumi- 
nosity of 2.8 fb~!. Events were selected passing single jet triggers, with a minimal jet pr threshold 
of 100 GeV. The triggers were subject to a prescaling depending on the instantaneous luminos- 
ity and the energy of the jet triggered on. This suppresses low pr jets, while keeping all events 
including a jet with at least pr > 1 TeV, leading to a more uniform transverse momentum spec- 
trum. Events were required to pass standard data quality requirements, and contain at least one 
reconstructed primary vertex, associated to at least three tracks. 

Data is compared with a Monte Carlo simulation, generated by Pytuta 8.186 [5]. Generator 
parameters were set according to the A14 tune for parton showering and hadronization, and parton 
distribution functions (PDF) were taken from NNPDF23L0 [6]. For comparison, samples were 
also generated using HERwic++ 2.7.1 [7] with the UEEES tune and the CTEQ6L1 PDF set [9], 
as well as SHerPA 2.1 [8] using CT10 PDFs [10]. Events are digitized using a GEANT4 based 
simulation of the ATLAS detector, and then reconstructed using the same reconstruction algorithms 
as used for data. Monte Carlo events are finally reweighted to match the number of events triggered 
on in data. 


3 Object Selection 


Jets used were seeded from topological clusters [11] and reconstructed by the anti-kr algorithm [12] 
with a cone radius of R = 0.4. They were required to have a transverse momentum of p > 
200 GeV and lie in the region of |7/'*| < 2.5. Jets have been calibrated to the hadronic jet en- 
ergy scale using a calibration derived from Monte Carlo [13]. It has been shown previously that 
simulated jet properties agree well with data [14]. 

Tracks are reconstructed using an iterative algorithm. They are seeded using combined mea- 
surements from the silicon detectors, and reconstructed using a combinatorial Kalman filter to- 


gether with a stringent ambiguity solver [15, 16]. The following cuts are applied to tracks: 
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Figure 1. Definition of template and data distributions. This and all further figures taken from [17] 


pis > 10 GeV 


Ink] < 1.2 


apt < 1.5 mm, where ane is the transverse impact parameter w.r.t. the beamline position 


ka sin6| < 1.5 mm, where ae is the distance in z direction between the track’s point of 
closest approach and the primary vertex, and @ is the polar angle of the track at this point 


Number of SCT hits > 6 


Number of pixel holes? < 1 


4 Template Fit Method 


The goal of this method is to determine the fraction of tracks lost due to merged clusters in jet 
cores. The energy deposition dE/dx of pixel clusters follows a Landau distribution [18], assuming 
the material is sufficiently thin and only single particles hit the clusters. The peak of the distribution 
is around the minimally ionizing particle (MIP) energy. In the case where two particles contribute 
to the same cluster, a second peak at 2x the MIP energy is visible. A third weaker peak can appear 
for three particles hitting the same cluster. 

When a cluster is assigned to only one reconstructed track (not multiply used), two situations 
are possible: The cluster was indeed hit by only one particle, or it was hit by one reconstructed 
particle, and another missed one. It is impossible to distinguish both situations on a per-cluster 
basis, but one can determine statistically how often each situation occurs, by comparing the two 
peaks in the dE/dx distribution. From this, the probability that a track is lost due to merging can 
be computed. 

To determine the individual contributions of both cases, a template fit is used. The data to be 
fitted is the dE/dx distribution of not multiply used clusters, in the core of jets (angular separation? 


?A pixel hole is defined as a expected hit, where the reconstructed track crosses the detector surface, but no hit is 
recorded. Inactive parts such as sensor edges or disabled modules are excluded from the definition and do not create 
holes. 


3 Angular distance is given by AR = /Ag? + Ar? 
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Figure 2. Left: Single and multiple track templates, derived from data. Right: Energy loss dE/dx distribu- 
tion from pixel clusters in jet cores, to be fit with the templates. From [17]. 
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Figure 3. Result of the template fit for data, for values of jet pr = 200-400 GeV (left) and pr = 1000- 
1200 GeV (right). From [17]. 


between center of jet and track AR(trk, jet) < 0.05). To get a sample that is enriched in clusters 
hit by single tracks (single-track template), a selection outside of the jet core (AR(trk, jet) > 0.1) 
is applied. An enriched multiple-track template is obtained by staying in the jet core, but using 
multiply-used clusters instead (that is, clusters that have multiple confirmed reconstructed tracks). 
The selection is summarized in figure 1. Data is separated into seven pr bins ranging from 200- 
1600 GeV. Since at high p the available statistics is low, the templates taken at i = 200- 
400 GeV are used to fit all distributions in data. The single- and multiple-track templates, and 
a data distribution can be seen in figure 2. To minimize the influence of clusters which were hit 
by three tracks, the fit was performed in a reduced region of 0.8-3.2 MeV g`! cm? for MC, and 
0.67-3.07 MeV g~! cm? for data. The regions were chosen such that the fraction of all clusters 
they contain is the same in data as in MC. 

The fraction of lost tracks Figss is given directly by the fit fraction of the multiple-track tem- 
plate. The fit result for data can be seen in figure 3 for two different pr bins. Similar plots for Monte 
Carlo simulation are shown in [17]. The fraction of lost tracks depending on pi is shown in fig- 
ure 4, for data and simulation. The loss fraction increases with p, and shows over the whole range 
agreement between data and simulation, within systematic uncertainties as outlined in section 5. 
The discrepancy between central values of data and simulation is approximately 25% [17]. 
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Figure 4. Fraction of lost tracks due to merged clusters, determined in data and simulation for varying val- 
ues of pe. Shaded areas show total uncertainty including systematic uncertainties as described in section 5. 
From [17]. 


5 Systematic Uncertainties 


The systematic uncertainties for simulation are dominated by Monte Carlo generator differences. 
This uncertainty has been evaluated by comparing the fit results from PyTu1a8, SHERPA and HeEr- 
wic++ samples. The relative systematic uncertainty ranges from 41% at 200—400 GeV to 5% at 
1000-1200 GeV. For details, see [17]. 

An additional uncertainty comes from the choice of fit region. It was found that varying the 
upper edge of the region changes Fjost, however only significantly in data. A systematic uncertainty 
of the size of the maximal change in Fjost has been applied in each pr bin, varying between 12% 
and 25%. 

In data, an uncertainty results from using low ra templates to fit high ps data. A check has 
been carried out with a simulation of high statistics, and it was found that the fraction of clusters 
with three contributing tracks varies as a function of pr. This leads to a small bias in the resulting 
value of Figs which has been taken into account as a systematic uncertainty. The size of this 
uncertainty is between 11% and 17%. 


6 Conclusions 


The tracking inefficiency in jet cores has been determined using measurements of energy deposition 
in the ATLAS pixel detector, on Ys = 13 TeV LHC data taken in 2015. It was found that the 
fraction of lost tracks due to cluster merging is between 1%-5% for jet pr = 200-1600 GeV. The 
data and simulation are found to agree within 25% in the investigated jet pr range. 
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